Overview

Dataset statistics

Number of variables21
Number of observations1391
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory228.3 KiB
Average record size in memory168.1 B

Variable types

NUM18
CAT2
BOOL1

Warnings

Members is highly correlated with ScoredByHigh correlation
ScoredBy is highly correlated with MembersHigh correlation
EndingDate_year is highly correlated with StartingDate_yearHigh correlation
StartingDate_year is highly correlated with EndingDate_yearHigh correlation
normalizedScore is highly correlated with ScoreHigh correlation
Score is highly correlated with normalizedScoreHigh correlation
normalizedScore is highly correlated with ScoreHigh correlation
Score is highly correlated with normalizedScoreHigh correlation
Episodes is highly skewed (γ1 = 21.17581687) Skewed
df_index has unique values Unique
Type has 333 (23.9%) zeros Zeros
StartingSeason has 274 (19.7%) zeros Zeros
BroadcastTime has 32 (2.3%) zeros Zeros
Sources has 74 (5.3%) zeros Zeros
Rating has 115 (8.3%) zeros Zeros
BroadcastDay has 105 (7.5%) zeros Zeros

Reproduction

Analysis started2020-10-28 17:45:40.965995
Analysis finished2020-10-28 17:46:57.618909
Duration1 minute and 16.65 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

df_index
Real number (ℝ≥0)

UNIQUE

Distinct1391
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean769.0366643
Minimum0
Maximum1562
Zeros1
Zeros (%)0.1%
Memory size10.9 KiB

Quantile statistics

Minimum0
5-th percentile72.5
Q1380.5
median762
Q31155.5
95-th percentile1484.5
Maximum1562
Range1562
Interquartile range (IQR)775

Descriptive statistics

Standard deviation450.6973104
Coefficient of variation (CV)0.5860543863
Kurtosis-1.179967244
Mean769.0366643
Median Absolute Deviation (MAD)388
Skewness0.03402664682
Sum1069730
Variance203128.0656
MonotocityStrictly increasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
156210.1%
 
51310.1%
 
50410.1%
 
50510.1%
 
50610.1%
 
50710.1%
 
50810.1%
 
50910.1%
 
51110.1%
 
51210.1%
 
Other values (1381)138199.3%
 
ValueCountFrequency (%) 
010.1%
 
110.1%
 
210.1%
 
410.1%
 
510.1%
 
ValueCountFrequency (%) 
156210.1%
 
156110.1%
 
156010.1%
 
155910.1%
 
155810.1%
 

Type
Real number (ℝ≥0)

ZEROS

Distinct5
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.44356578
Minimum0
Maximum5
Zeros333
Zeros (%)23.9%
Memory size10.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q12
median5
Q35
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.075086878
Coefficient of variation (CV)0.6025982978
Kurtosis-0.9947110193
Mean3.44356578
Median Absolute Deviation (MAD)0
Skewness-0.8615442008
Sum4790
Variance4.305985549
MonotocityNot monotonic
Histogram with fixed size bins (bins=5)
ValueCountFrequency (%) 
579657.2%
 
033323.9%
 
318413.2%
 
4513.7%
 
2271.9%
 
ValueCountFrequency (%) 
033323.9%
 
2271.9%
 
318413.2%
 
4513.7%
 
579657.2%
 
ValueCountFrequency (%) 
579657.2%
 
4513.7%
 
318413.2%
 
2271.9%
 
033323.9%
 

Episodes
Real number (ℝ≥0)

SKEWED

Distinct106
Distinct (%)7.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.64414091
Minimum1
Maximum1787
Zeros0
Zeros (%)0.0%
Memory size10.9 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median12
Q325
95-th percentile59
Maximum1787
Range1786
Interquartile range (IQR)24

Descriptive statistics

Standard deviation58.37256211
Coefficient of variation (CV)2.827560729
Kurtosis609.8407683
Mean20.64414091
Median Absolute Deviation (MAD)11
Skewness21.17581687
Sum28716
Variance3407.356007
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
140128.8%
 
1217612.7%
 
131359.7%
 
261057.5%
 
25644.6%
 
24634.5%
 
2533.8%
 
3292.1%
 
4251.8%
 
11251.8%
 
Other values (96)31522.6%
 
ValueCountFrequency (%) 
140128.8%
 
2533.8%
 
3292.1%
 
4251.8%
 
570.5%
 
ValueCountFrequency (%) 
178710.1%
 
50010.1%
 
37310.1%
 
36610.1%
 
35810.1%
 

Status
Boolean

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
1
1386 
0
 
5
ValueCountFrequency (%) 
1138699.6%
 
050.4%
 

StartingSeason
Real number (ℝ≥0)

ZEROS

Distinct5
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.051761323
Minimum0
Maximum4
Zeros274
Zeros (%)19.7%
Memory size10.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median3
Q33
95-th percentile4
Maximum4
Range4
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.340104469
Coefficient of variation (CV)0.6531483241
Kurtosis-1.292893389
Mean2.051761323
Median Absolute Deviation (MAD)1
Skewness-0.3370269752
Sum2854
Variance1.795879989
MonotocityNot monotonic
Histogram with fixed size bins (bins=5)
ValueCountFrequency (%) 
359542.8%
 
027419.7%
 
125318.2%
 
413910.0%
 
21309.3%
 
ValueCountFrequency (%) 
027419.7%
 
125318.2%
 
21309.3%
 
359542.8%
 
413910.0%
 
ValueCountFrequency (%) 
413910.0%
 
359542.8%
 
21309.3%
 
125318.2%
 
027419.7%
 

BroadcastTime
Real number (ℝ≥0)

ZEROS

Distinct81
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean66.11358735
Minimum0
Maximum81
Zeros32
Zeros (%)2.3%
Memory size10.9 KiB

Quantile statistics

Minimum0
5-th percentile4
Q163
median81
Q381
95-th percentile81
Maximum81
Range81
Interquartile range (IQR)18

Descriptive statistics

Standard deviation25.91222113
Coefficient of variation (CV)0.3919348831
Kurtosis0.7870356258
Mean66.11358735
Median Absolute Deviation (MAD)0
Skewness-1.547752944
Sum91964
Variance671.4432037
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
8189564.3%
 
63342.4%
 
0322.3%
 
68261.9%
 
18251.8%
 
67211.5%
 
2211.5%
 
79181.3%
 
19161.2%
 
65161.2%
 
Other values (71)28720.6%
 
ValueCountFrequency (%) 
0322.3%
 
110.1%
 
2211.5%
 
320.1%
 
4151.1%
 
ValueCountFrequency (%) 
8189564.3%
 
8010.1%
 
79181.3%
 
7810.1%
 
7710.1%
 

Sources
Real number (ℝ≥0)

ZEROS

Distinct14
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.310567937
Minimum0
Maximum13
Zeros74
Zeros (%)5.3%
Memory size10.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q16
median6
Q38
95-th percentile10
Maximum13
Range13
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.590218825
Coefficient of variation (CV)0.4104573235
Kurtosis1.155437527
Mean6.310567937
Median Absolute Deviation (MAD)1
Skewness-0.3042376348
Sum8778
Variance6.709233562
MonotocityNot monotonic
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%) 
669349.8%
 
925418.3%
 
51319.4%
 
8745.3%
 
0745.3%
 
1463.3%
 
4362.6%
 
12322.3%
 
13251.8%
 
10141.0%
 
Other values (4)120.9%
 
ValueCountFrequency (%) 
0745.3%
 
1463.3%
 
280.6%
 
310.1%
 
4362.6%
 
ValueCountFrequency (%) 
13251.8%
 
12322.3%
 
1120.1%
 
10141.0%
 
925418.3%
 

Duration
Real number (ℝ≥0)

Distinct119
Distinct (%)8.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1267.529835
Minimum0
Maximum3776
Zeros3
Zeros (%)0.2%
Memory size10.9 KiB

Quantile statistics

Minimum0
5-th percentile89
Q1896
median1536
Q31536
95-th percentile2336
Maximum3776
Range3776
Interquartile range (IQR)640

Descriptive statistics

Standard deviation740.4823931
Coefficient of variation (CV)0.5841932654
Kurtosis0.6915012534
Mean1267.529835
Median Absolute Deviation (MAD)64
Skewness-0.005675816709
Sum1763134
Variance548314.1745
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
153642530.6%
 
147224817.8%
 
160014110.1%
 
1408402.9%
 
1920231.7%
 
1664221.6%
 
320161.2%
 
90161.2%
 
100141.0%
 
128141.0%
 
Other values (109)43231.1%
 
ValueCountFrequency (%) 
030.2%
 
6150.4%
 
6220.1%
 
6320.1%
 
64130.9%
 
ValueCountFrequency (%) 
377650.4%
 
371220.1%
 
364810.1%
 
358410.1%
 
352010.1%
 

Rating
Real number (ℝ≥0)

ZEROS

Distinct5
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.914450036
Minimum0
Maximum4
Zeros115
Zeros (%)8.3%
Memory size10.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q13
median3
Q33
95-th percentile4
Maximum4
Range4
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.002450076
Coefficient of variation (CV)0.3439585731
Kurtosis3.306394395
Mean2.914450036
Median Absolute Deviation (MAD)0
Skewness-1.827766584
Sum4054
Variance1.004906154
MonotocityNot monotonic
Histogram with fixed size bins (bins=5)
ValueCountFrequency (%) 
392266.3%
 
429321.1%
 
01158.3%
 
2554.0%
 
160.4%
 
ValueCountFrequency (%) 
01158.3%
 
160.4%
 
2554.0%
 
392266.3%
 
429321.1%
 
ValueCountFrequency (%) 
429321.1%
 
392266.3%
 
2554.0%
 
160.4%
 
01158.3%
 

Score
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
7
923 
8
453 
9
 
15
ValueCountFrequency (%) 
792366.4%
 
845332.6%
 
9151.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length1
Median length1
Mean length1
Min length1

ScoredBy
Real number (ℝ≥0)

HIGH CORRELATION

Distinct1384
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean65447.0115
Minimum144
Maximum993775
Zeros0
Zeros (%)0.0%
Memory size10.9 KiB

Quantile statistics

Minimum144
5-th percentile1466.5
Q17771
median27205
Q374694.5
95-th percentile278476.5
Maximum993775
Range993631
Interquartile range (IQR)66923.5

Descriptive statistics

Standard deviation103898.727
Coefficient of variation (CV)1.587524389
Kurtosis17.6988829
Mean65447.0115
Median Absolute Deviation (MAD)23210
Skewness3.556528143
Sum91036793
Variance1.079494547e+10
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
298020.1%
 
1466520.1%
 
531920.1%
 
290020.1%
 
4286920.1%
 
223220.1%
 
117820.1%
 
886810.1%
 
4697410.1%
 
270710.1%
 
Other values (1374)137498.8%
 
ValueCountFrequency (%) 
14410.1%
 
19010.1%
 
23610.1%
 
29010.1%
 
29710.1%
 
ValueCountFrequency (%) 
99377510.1%
 
92542010.1%
 
90211610.1%
 
71970610.1%
 
67511310.1%
 

Members
Real number (ℝ≥0)

HIGH CORRELATION

Distinct1382
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean124220.087
Minimum781
Maximum1432871
Zeros0
Zeros (%)0.0%
Memory size10.9 KiB

Quantile statistics

Minimum781
5-th percentile4734
Q119547
median60379
Q3147930
95-th percentile490481.5
Maximum1432871
Range1432090
Interquartile range (IQR)128383

Descriptive statistics

Standard deviation173211.2713
Coefficient of variation (CV)1.394390195
Kurtosis10.80533956
Mean124220.087
Median Absolute Deviation (MAD)48512
Skewness2.870009981
Sum172790141
Variance3.00021445e+10
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1047220.1%
 
1012120.1%
 
1789220.1%
 
1149320.1%
 
1186720.1%
 
756620.1%
 
2046320.1%
 
3345420.1%
 
14303520.1%
 
8670510.1%
 
Other values (1372)137298.6%
 
ValueCountFrequency (%) 
78110.1%
 
114810.1%
 
122110.1%
 
137310.1%
 
146410.1%
 
ValueCountFrequency (%) 
143287110.1%
 
132244710.1%
 
127986010.1%
 
117636810.1%
 
99617110.1%
 

Favorites
Real number (ℝ≥0)

Distinct899
Distinct (%)64.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2521.710999
Minimum1
Maximum105387
Zeros0
Zeros (%)0.0%
Memory size10.9 KiB

Quantile statistics

Minimum1
5-th percentile10
Q167.5
median351
Q31487.5
95-th percentile11860
Maximum105387
Range105386
Interquartile range (IQR)1420

Descriptive statistics

Standard deviation7549.117376
Coefficient of variation (CV)2.993648907
Kurtosis64.69936288
Mean2521.710999
Median Absolute Deviation (MAD)331
Skewness6.963697955
Sum3507700
Variance56989173.16
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
14120.9%
 
7110.8%
 
21110.8%
 
4110.8%
 
15110.8%
 
13110.8%
 
20100.7%
 
990.6%
 
580.6%
 
3080.6%
 
Other values (889)128992.7%
 
ValueCountFrequency (%) 
120.1%
 
250.4%
 
370.5%
 
4110.8%
 
580.6%
 
ValueCountFrequency (%) 
10538710.1%
 
9036510.1%
 
8787210.1%
 
6332410.1%
 
6331710.1%
 

StartingDate_year
Real number (ℝ≥0)

HIGH CORRELATION

Distinct47
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2007.849029
Minimum1970
Maximum2018
Zeros0
Zeros (%)0.0%
Memory size10.9 KiB

Quantile statistics

Minimum1970
5-th percentile1988.5
Q12005
median2010
Q32014
95-th percentile2017
Maximum2018
Range48
Interquartile range (IQR)9

Descriptive statistics

Standard deviation8.757413969
Coefficient of variation (CV)0.004361589861
Kurtosis2.062638308
Mean2007.849029
Median Absolute Deviation (MAD)5
Skewness-1.485787204
Sum2792918
Variance76.69229942
MonotocityNot monotonic
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%) 
20161027.3%
 
20151027.3%
 
20141007.2%
 
2017987.0%
 
2011906.5%
 
2013866.2%
 
2009795.7%
 
2012775.5%
 
2010745.3%
 
2008674.8%
 
Other values (37)51637.1%
 
ValueCountFrequency (%) 
197010.1%
 
197110.1%
 
197410.1%
 
197510.1%
 
197610.1%
 
ValueCountFrequency (%) 
2018191.4%
 
2017987.0%
 
20161027.3%
 
20151027.3%
 
20141007.2%
 

StartingDate_month
Real number (ℝ≥0)

Distinct12
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.3465133
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Memory size10.9 KiB

Quantile statistics

Minimum1
5-th percentile1
Q14
median7
Q310
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.421365434
Coefficient of variation (CV)0.5390937153
Kurtosis-1.299437066
Mean6.3465133
Median Absolute Deviation (MAD)3
Skewness-0.02043946508
Sum8828
Variance11.70574143
MonotocityNot monotonic
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%) 
430121.6%
 
1029621.3%
 
717612.7%
 
115711.3%
 
3836.0%
 
12755.4%
 
8695.0%
 
9574.1%
 
2554.0%
 
11503.6%
 
Other values (2)725.2%
 
ValueCountFrequency (%) 
115711.3%
 
2554.0%
 
3836.0%
 
430121.6%
 
5312.2%
 
ValueCountFrequency (%) 
12755.4%
 
11503.6%
 
1029621.3%
 
9574.1%
 
8695.0%
 

StartingDate_day
Real number (ℝ≥0)

Distinct31
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.69590223
Minimum1
Maximum31
Zeros0
Zeros (%)0.0%
Memory size10.9 KiB

Quantile statistics

Minimum1
5-th percentile2
Q15
median9
Q318
95-th percentile27
Maximum31
Range30
Interquartile range (IQR)13

Descriptive statistics

Standard deviation8.169445031
Coefficient of variation (CV)0.6984878012
Kurtosis-0.718130622
Mean11.69590223
Median Absolute Deviation (MAD)5
Skewness0.693338319
Sum16269
Variance66.73983212
MonotocityNot monotonic
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%) 
61047.5%
 
4977.0%
 
7956.8%
 
5916.5%
 
3866.2%
 
8846.0%
 
9684.9%
 
2554.0%
 
10554.0%
 
1503.6%
 
Other values (21)60643.6%
 
ValueCountFrequency (%) 
1503.6%
 
2554.0%
 
3866.2%
 
4977.0%
 
5916.5%
 
ValueCountFrequency (%) 
3170.5%
 
30201.4%
 
29191.4%
 
28201.4%
 
27141.0%
 

EndingDate_year
Real number (ℝ≥0)

HIGH CORRELATION

Distinct46
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2008.308411
Minimum1971
Maximum2019
Zeros0
Zeros (%)0.0%
Memory size10.9 KiB

Quantile statistics

Minimum1971
5-th percentile1989
Q12005
median2011
Q32015
95-th percentile2017
Maximum2019
Range48
Interquartile range (IQR)10

Descriptive statistics

Standard deviation8.483445224
Coefficient of variation (CV)0.004224174523
Kurtosis2.204605875
Mean2008.308411
Median Absolute Deviation (MAD)4
Skewness-1.512720427
Sum2793557
Variance71.96884287
MonotocityNot monotonic
Histogram with fixed size bins (bins=46)
ValueCountFrequency (%) 
20161107.9%
 
20151097.8%
 
20171017.3%
 
2014946.8%
 
2013926.6%
 
2011856.1%
 
2010825.9%
 
2009775.5%
 
2012765.5%
 
2008664.7%
 
Other values (36)49935.9%
 
ValueCountFrequency (%) 
197110.1%
 
197210.1%
 
197510.1%
 
197710.1%
 
197840.3%
 
ValueCountFrequency (%) 
201910.1%
 
2018292.1%
 
20171017.3%
 
20161107.9%
 
20151097.8%
 

EndingDate_month
Real number (ℝ≥0)

Distinct12
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.674335011
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Memory size10.9 KiB

Quantile statistics

Minimum1
5-th percentile2
Q13
median7
Q39
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.454063789
Coefficient of variation (CV)0.5175142968
Kurtosis-1.320565955
Mean6.674335011
Median Absolute Deviation (MAD)3
Skewness0.1063500938
Sum9284
Variance11.93055666
MonotocityNot monotonic
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%) 
330521.9%
 
924817.8%
 
1219914.3%
 
61279.1%
 
41047.5%
 
7846.0%
 
8674.8%
 
2664.7%
 
10604.3%
 
1483.5%
 
Other values (2)836.0%
 
ValueCountFrequency (%) 
1483.5%
 
2664.7%
 
330521.9%
 
41047.5%
 
5412.9%
 
ValueCountFrequency (%) 
1219914.3%
 
11423.0%
 
10604.3%
 
924817.8%
 
8674.8%
 

EndingDate_day
Real number (ℝ≥0)

Distinct31
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.8317757
Minimum1
Maximum31
Zeros0
Zeros (%)0.0%
Memory size10.9 KiB

Quantile statistics

Minimum1
5-th percentile2
Q115
median23
Q327
95-th percentile30
Maximum31
Range30
Interquartile range (IQR)12

Descriptive statistics

Standard deviation8.737607278
Coefficient of variation (CV)0.4405862294
Kurtosis-0.5777478792
Mean19.8317757
Median Absolute Deviation (MAD)5
Skewness-0.786784183
Sum27586
Variance76.34578095
MonotocityNot monotonic
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%) 
261128.1%
 
251067.6%
 
27916.5%
 
24755.4%
 
28735.2%
 
30715.1%
 
29705.0%
 
23664.7%
 
20533.8%
 
21523.7%
 
Other values (21)62244.7%
 
ValueCountFrequency (%) 
1483.5%
 
2282.0%
 
3241.7%
 
4342.4%
 
5191.4%
 
ValueCountFrequency (%) 
31453.2%
 
30715.1%
 
29705.0%
 
28735.2%
 
27916.5%
 

BroadcastDay
Real number (ℝ≥0)

ZEROS

Distinct9
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.641984184
Minimum0
Maximum8
Zeros105
Zeros (%)7.5%
Memory size10.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q14
median7
Q37
95-th percentile7
Maximum8
Range8
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.276831719
Coefficient of variation (CV)0.4035515955
Kurtosis0.6935375993
Mean5.641984184
Median Absolute Deviation (MAD)0
Skewness-1.412040961
Sum7848
Variance5.183962679
MonotocityNot monotonic
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%) 
786262.0%
 
41107.9%
 
01057.5%
 
31007.2%
 
6695.0%
 
5543.9%
 
1443.2%
 
8423.0%
 
250.4%
 
ValueCountFrequency (%) 
01057.5%
 
1443.2%
 
250.4%
 
31007.2%
 
41107.9%
 
ValueCountFrequency (%) 
8423.0%
 
786262.0%
 
6695.0%
 
5543.9%
 
41107.9%
 

normalizedScore
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
-0.6966864842
923 
1.309712894
453 
3.316112271
 
15
ValueCountFrequency (%) 
-0.696686484292366.4%
 
1.30971289445332.6%
 
3.316112271151.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length19
Median length19
Mean length18.65276779
Min length17

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

df_indexTypeEpisodesStatusStartingSeasonBroadcastTimeSourcesDurationRatingScoreScoredByMembersFavoritesStartingDate_yearStartingDate_monthStartingDate_dayEndingDate_yearEndingDate_monthEndingDate_dayBroadcastDaynormalizedScore
0056411596153649719706117636810538720094520107443.316112
11011381910639454969705186339362016826201682673.316112
2255111636153649702791943595597201548201633083.316112
3452411261215363955279199041990365201146201191483.316112
45311013818166449284521217728370198818199731773.316112
5655111636153639907582122384533201144201232613.316112
67514810576147239395162705225633242011102201492443.316112
78522107661600392628480166196120171014201833133.316112
89513106361536396258212161214982012104201332853.316112
91001138161103960295104021137420137620137673.316112

Last rows

df_indexTypeEpisodesStatusStartingSeasonBroadcastTimeSourcesDurationRatingScoreScoredByMembersFavoritesStartingDate_yearStartingDate_monthStartingDate_dayEndingDate_yearEndingDate_monthEndingDate_dayBroadcastDaynormalizedScore
138115520113816930797820183200242020024207-0.696686
138215535251018914724745933107062944201410520153294-0.696686
1383155552310811012800756214642198810219893267-0.696686
13841556512141511536371759251297729200711220073300-0.696686
13851557526106561600472072740539199910820003316-0.696686
1386155851212816153647171506296985357620107220109177-0.696686
13871559311381617923760621211142013862013867-0.696686
1388156001138169537615051042881292009812009817-0.696686
1389156101138109037305412868122012692012697-0.696686
1390156251212269153647463349929933020147820149236-0.696686